Rhythmic organization and signal characteristics of speech
نویسنده
چکیده
The Converter-Distributor (C/D) Model, a generative theory of phonetic implementation, describes an utterance as a linear string of syllables with intervening boundaries. Its base component includes phonetic status contours for voicing, tonal, and vocalic gestures. Consonantal elemental gestures, as stored impulse responses, are excited by the syllable pulse and superimposed onto the base function. A magnitude-modulated syllable-boundary pulse train constitutes a skeletal representation of the rhythmic organization of the utterance. All the temporal characteristics of the speech signal are computed based on the input specifications for each syllable by phonological features and the metrical structure, numerically augmented by prominence enhancement specified for the discourse situation, along with system parameter settings for the particular speaker in each discourse. Segmental durations in the acoustic signal vary according to syllable magnitude, not uniformly among consonants and vowels. The C/D model predicts complex patterns of such prosodic effects on segmental duration as a function of fixed threshold values for relating abstract gestures to observable durations of acoustic signals. (Supported in part by NSF and ATR)
منابع مشابه
Rhythmic variability between some asian languages: results from an automatic analysis of temporal characteristics
The rhythmic organization of speech can vary between languages. In the present research we studied rhythmic variability between Mandarin, Cantonese and Thai using automatically retrieved prosodic temporal characteristics from read speech. We measured the variability of intervals between amplitude peaks in the amplitude envelope (<10 Hz) and the durational characteristics of intervals with and w...
متن کاملبررسی برخی ویژگی های آکوستیک گفتار نوزاد مدار در مادران فارسی زبان
Introduction: When adults talk to another person, linguistic characteristics of the listener will also be considered. A clear example of speech changes depending on the listener is maternal or infant directed speech. Infant directed speech is more slowly with longer sentences and pauses at the end of the utterance. Undoubtedly the most distinctive feature of this style of speech is acoustic c...
متن کاملSpeaker idiosyncratic rhythmic features in the speech signal
Speakers' voices are to a high degree individual. In the present paper we report about an ongoing research project in which we study how temporal characteristics of human speech (e.g. segmental or prosodic timing patterns, speech rhythmic characteristics and durational patterns of voicing) contribute to speaker individuality. We report about the creation of the TEVOID-Corpus (Temporal Voice Idi...
متن کاملThe Role of Temporal Amplitude Modulations in the Political Arena: Hillary Clinton vs. Donald Trump
Speech is an acoustic signal with inherent amplitude modulations in the 1-9 Hz range. Recent models of speech perception propose that this rhythmic nature of speech is central to speech recognition. Moreover, rhythmic amplitude modulations have been shown to have beneficial effects on language processing and the subjective impression listeners have of the speaker. This study investigated the ro...
متن کاملPersian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000